Skew detection and correction in document images bsed on straight-line fitting
نویسندگان
چکیده
During document scanning, skew is inevitably introduced into the incoming document image. Since the algorithms for layout analysis and character recognition are generally very sensitive to the page skew, skew detection and correction in document images are the critical steps before layout analysis. In this paper, a novel skew detection method based on straight-line fitting is proposed. And a concept of eigen-point is introduced. After the relations between the neighboring eigen-points in every text line within a suitable sub-region were analyzed, the eigen-points most possibly laid on the baselines are selected as samples for the straight-line fitting. The average of these baseline directions is computed, which corresponds to the degree of skew of the whole document image. Then a fast skew correction method based on the scanning line model is also presented. Experiments prove that the proposed approaches are fast and accurate.
منابع مشابه
Document Image Dewarping Based on Text Line Detection and Surface Modeling (RESEARCH NOTE)
Document images produced by scanner or digital camera, usually suffer from geometric and photometric distortions. Both of them deteriorate the performance of OCR systems. In this paper, we present a novel method to compensate for undesirable geometric distortions aiming to improve OCR results. Our methodology is based on finding text lines by dynamic local connectivity map and then applying a l...
متن کاملGeometric Correction for Braille Document Images
Braille system has been used by the visually impaired people for reading.The shortage of Braille books has caused a need for conversion of Braille to text. This paper addresses the geometric correction of a Braille document images. Due to the standard measurement of the Braille cells, identification of Braille characters could be achieved by simple cell overlapping procedure. The standard measu...
متن کاملSkew and Slant Correction for Document Images Using Gradient Direction - Document Analysis and Recognition, 1997., Proceedings of the Fourth International Conference on
A fast algorithm is presented in this pa;per for skew and slant correction in printed document images. The algorithm employs only the gradient information. The skew angle is obtained by searching for a peak in the histogram of the gradient orientation of the input greylevel image. The skewness of the document is corrected by a rotation at such an angle. The slant of characters can also be detec...
متن کاملImproved Skew Detection and Correction Approach Using Discrete Fourier Algorithm
The main objective of Image processing is to convert an image into digital form and perform some operations on it, in order to get an enhanced image or to extract some useful information from it. But when they are needed to be converted into electronic form, it has to be done through scanning. One of the major problems in this field is that if the document to be read is not placed at 90. This w...
متن کاملSkew Estimation in Document Images Based on an Energy Minimization Framework
Skew estimation is important for document analysis and application. Most existing methods are proposed to deal with the document images consisting of words. In most cases, a complex document may include tables, irregular pictures and other non-text components. To address the challenging problem, this paper proposes a novel skew estimation approach based on an energy minimization framework for s...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Pattern Recognition Letters
دوره 24 شماره
صفحات -
تاریخ انتشار 2003